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Abstract— In this study, we collected 141 coding sequences of 
MERS-CoV from the National Center for Biotechnology and 
Information (NCBI), including the sequences isolated in Korea 
in 2015. We conducted the phylogenetic analysis using the 
Maximum Likelihood method to examine the overall variation 
patterns among the target sequences, and then, we divided the 
target sequences into 4 different groups according to occurred 
countries and host species. Using the codon analyzer named 
SimFluVar program, we analyzed the codon variation patterns 
in the wobble position among 4 groups. In order to investigate 
the effect of codon variations that can change the phenotype of 
target genes, we compared the transversional substitutions 
between the Korean-origin sequences and other groups. As a 
result, we found that the Korea-origin sequences showed very 
minor differences with those collected from the Saudi Arabia in 
2015, whereas other groups which were collected from USA and 
UK in 2013 and 2014 showed more complicated differences. We 
also compared the Korea-origin sequences with those of camel- 
origin sequences, and we found that the substitution pattern was 
somewhat different with that of human-origin viruses. 
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L 


Since the first case of the Middle East respiratory 
coronavirus (MERS-CoV) was reported in 2012, this novel 
virus has been spread among the Middle-Eastern countries 
such as Egypt, Oman, Qatar, and Saudi Arabia. The route of 
transmission is not clearly understood, but, according to the 
phylogenetic studies, MERS-CoV showed the high 
similarities with the camel-origin MERS-CoV. In May 20, 
2015, MERS-CoV was transmitted to Korea through a person 
who visited the Arabian Peninsula. Because of the high 
pathogenicity and infectivity, the MERS-CoV which was 
collected in Korea has been doubted as a novel variant of 
MERS-CoVs. In this study, we determined the outline of the 
variation patterns among 141 MERS-CoVs sequences using 
phylogenetic analysis method, and then, performed more 
intensive analysis using the codon analyzer named SimFluVar 


(http://Icbbsnu.ac.kr/simfluvar) to identify the minor, but 
might be important, differences among them. 


INTRODUCTION 


978-1-4673-6799-8/15/$31.00 ©2015 IEEE 


1718 


Insung Ahn 
Department of Big Data Science, 
University of Science & Technology, 
Daejeon, Korea 
e-mail: isahn@ust.ac.kr 


IL MATERIALS & METHODS 


A. Sequences 


In the present study, 141 nucleotide sequences (coding 
sequences) of MERS-CoVs were collected from the National 
Center for Biotechnology and Information in the United States. 
We used the ‘ORF la-ORF lan-S-ORF3-ORF4a-ORF5-E-M- 
N-ORF8b’ region of MERS-CoVs whose nucleotide length 
was 4062 base pairs. The data used spanned over a period of 
4 years, from 2012 to 2015, including the sequences occurred 
in Korea. 


B. Phylogenetic analysis 


For phylogenetic analysis, multiple sequence alignment 
was performed on the collated sequences using the ClustalW 
program (gap open penalty = 40, gap extension penalty = 0.8) 
[1]. Using the aligned sequences that included the gaps 
generated from multiple sequence alignment, phylogenetic 
correlation was performed using MEGA (version 6.06) 
program [2] to create the maximum-likelihood tree (circular 
cladogram) based on Tamura-Nei substitution model [3]. 


C. Comparison of codon variation patterns 


To ascertain the codon substitution pattern of MERS- 
CoVs, we extracted 25 sequences which can represent the 
minor differences among 141 MERS-CoVs. We divided the 
target sequences into 4 different groups according to their 
epidemiological characteristics. Analysis was performed 
using the SimFluVar — (http://Icbb.snu.ac.k1/simfluvar) 
program, which enabled sequence analysis between different 
genome groups. The SimFluVar is an analytical tool for 
calculating the codon substitution patterns of influenza virus. 
Designed to compare a large number of nucleotide sequences, 
SimFluVar provides precise patterns of codon variations 
between two viral groups, especially for the influenza virus. 
SimFluVar also provides the useful functions, such as editing 
and visualization of the result matrix [4]. 


L RESULTS 


In the early stage of our analysis, we performed the 
phylogenetic analysis to identify the approximate differences 
among MERS-CoVs. This is the traditional method 
commonly used worldwide when the disease outbreak occurs. 

Here, we suggest another method which can detect the 
minor, but might be important, patterns among the very 
similar viral sequences using the SimFluVar program. Using 
this software, we found that the Korea-origin sequences 
showed very minor differences with those collected from the 
Saudi Arabia in 2015, whereas other groups which were 
collected from USA and UK in 2013 and 2014 showed more 
complicated differences. We also compared the Korea-origin 
sequences with those of camel-origin sequences, and we 
found that the substitution pattern was somewhat different 
with that of human-origin viruses. 

In conclusion, codon variation analysis, especially the 
transversional substitution analysis, can be used as a good 
method to find a minor, but possibly important, difference 
among similar viral groups. 
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